首页> 外文OA文献 >All in a twitter: Self-tuning strategies for a deeper understanding of a crisis tweet collection
【2h】

All in a twitter: Self-tuning strategies for a deeper understanding of a crisis tweet collection

机译:推特上的一切:自我调整策略,可以更深入地了解危机鸣叫集

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Natural disasters have become more frequent during the past 20 years due to significant climate changes. These natural events are hotly debated on social networks like Twitter and a huge amount of short text messages are continuously and promptly exchanged with personal opinions, descriptions of the natural events and their corresponding consequences. The analysis of these large and complex data could help policy-makers to better understand the event as well as to set priorities. However, the correct configuration of the tweet mining process is still challenging due to variable data distribution and the availability of a large number of algorithms with different specific parameters. The analyst need to perform a large number of experiments to identify the best configuration for the overall knowledge discovery process. Innovative, scalable, and parameter-free solutions need to be explored to streamline the analytics process. This paper presents an enhanced version of PASTA (a distributed self-tuning engine) applied to a crisis tweet collection to group a corpus of tweets into cohesive and well-separated clusters with minimal analyst intervention. Experimental results performed on real data collected during natural disasters show the effectiveness of PASTA in discovering interesting groups of correlated tweets without selecting neither the algorithms nor their parameters.
机译:在过去的20年中,由于重大的气候变化,自然灾害变得更加频繁。这些自然事件在诸如Twitter之类的社交网络上引起了激烈的辩论,并且大量的短消息不断地与个人意见,对自然事件的描述及其相应后果进行交换。对这些庞大而复杂的数据进行分析可以帮助决策者更好地了解事件并确定优先级。然而,由于可变的数据分布以及具有不同特定参数的大量算法的可用性,推文挖掘过程的正确配置仍然具有挑战性。分析人员需要进行大量实验,才能确定整个知识发现过程的最佳配置。需要探索创新,可扩展且无参数的解决方案,以简化分析过程。本文介绍了PASTA的增强版本(分布式自调整引擎),该增强版应用于危机鸣叫集合,以最少的分析员干预将鸣叫的主体归纳为有凝聚力且分隔良好的集群。对自然灾害期间收集的真实数据进行的实验结果表明,PASTA在发现相关推文有趣的组而不选择算法或参数的情况下是有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号